108 research outputs found

    Opportunistic linked data querying through approximate membership metadata

    Get PDF
    Between URI dereferencing and the SPARQL protocol lies a largely unexplored axis of possible interfaces to Linked Data, each with its own combination of trade-offs. One of these interfaces is Triple Pattern Fragments, which allows clients to execute SPARQL queries against low-cost servers, at the cost of higher bandwidth. Increasing a client's efficiency means lowering the number of requests, which can among others be achieved through additional metadata in responses. We noted that typical SPARQL query evaluations against Triple Pattern Fragments require a significant portion of membership subqueries, which check the presence of a specific triple, rather than a variable pattern. This paper studies the impact of providing approximate membership functions, i.e., Bloom filters and Golomb-coded sets, as extra metadata. In addition to reducing HTTP requests, such functions allow to achieve full result recall earlier when temporarily allowing lower precision. Half of the tested queries from a WatDiv benchmark test set could be executed with up to a third fewer HTTP requests with only marginally higher server cost. Query times, however, did not improve, likely due to slower metadata generation and transfer. This indicates that approximate membership functions can partly improve the client-side query process with minimal impact on the server and its interface

    Substring filtering for low-cost linked data interfaces

    Get PDF
    Recently, Triple Pattern Fragments (TPFS) were introduced as a low-cost server-side interface when high numbers of clients need to evaluate SPARQL queries. Scalability is achieved by moving part of the query execution to the client, at the cost of elevated query times. Since the TPFS interface purposely does not support complex constructs such as SPARQL filters, queries that use them need to be executed mostly on the client, resulting in long execution times. We therefore investigated the impact of adding a literal substring matching feature to the TPFS interface, with the goal of improving query performance while maintaining low server cost. In this paper, we discuss the client/server setup and compare the performance of SPARQL queries on multiple implementations, including Elastic Search and case-insensitive FM-index. Our evaluations indicate that these improvements allow for faster query execution without significantly increasing the load on the server. Offering the substring feature on TPF servers allows users to obtain faster responses for filter-based SPARQL queries. Furthermore, substring matching can be used to support other filters such as complete regular expressions or range queries

    STUDY OF CURRENT APPROACHES FOR WEB PUBLISHING OF OPEN SCIENTIFIC DATA

    Get PDF
    Subject of Study. The subject of study of this work is closely related to the development of tools and technologies for Internet publishing of open data in machine-readable formats with regard to data of universities, educational and research organizations and scientific laboratories. We analyze the trends in the publishing formats most commonly used including not only popular formats such as pdf, csv, excel, but also the Semantic Web formats such as RDF. The paper describes the way of scientific data publication in semantic formats on the example of import and convertation of the information from University database. Methods. We describe the methods of publication for scientific open data in the network consisting of a set of transformations of the original data sets to the final semantic representation. These transformation steps include data upload from a relational database, data mapping on the ontological model (schema) and the generation of a set of RDF-triples corresponding to the initial database fragment. A description is given to the popular open data publishing systems, such as CKAN, VIVO, and others. OpenLink Virtuoso system is selected as the primary storage and data publication. The description of RDF data model is used as a way of presenting open data of ITMO University. Main Results. The authors have described the methods of scientific open data publication and identified their shortcomings. To demonstrate the efficiency of the proposed method of university open data publication, a software prototype has been developed available online at: http://lod.ifmo.ru/. The example of the system usage is also given. Practical Relevance. Implementation of the proposed approach will improve significantly the effect of the publication of university open data and make it available for third-party applications, such as applications for information retrieval about educational activities and research results, analysis of scientific activities in universities and their research departments

    Meta-analysis of non-alcoholic fatty liver disease and electromechanical reconstruction of myocardium

    Get PDF
    The present study aimed to analyze medical databases using various scientific articles; moreover, this experiment analyzed medical databases using published scientific articles related to NAFLD, endovascular treatment, cardiac arrhythmias and conduction disorders, changes in the geometry of atria and ventricles, changes in myocardial mass volume as well as diastolic flow left and right ventricular systolic functions, coronary blood flow, analysis of the dependence of epicardial fat tissue (EFT) thickness, and the presence of chronic heart failure (CHF

    Software for full-color 3D reconstruction of the biological tissues internal structure

    Full text link
    A software for processing sets of full-color images of biological tissue histological sections is developed. We used histological sections obtained by the method of high-precision layer-by-layer grinding of frozen biological tissues. The software allows restoring the image of the tissue for an arbitrary cross-section of the tissue sample. Thus, our method is designed to create a full-color 3D reconstruction of the biological tissue structure. The resolution of 3D reconstruction is determined by the quality of the initial histological sections. The newly developed technology available to us provides a resolution of up to 5 - 10 {\mu}m in three dimensions.Comment: 11 pages, 8 figure

    Spatial Models Developed Using Laser Scanning at Gas Condensate Fields in the Northern Construction-Climatic Zone

    Get PDF
    Wide exploration and industrial exploitation of hydrocarbon fields in Yamal Peninsula pose in front of construction and mining companies critical problems of efficient construction at constantly evolving fields taking into account climatic and geocryological conditions of their location. Yamal Peninsula is characterized by unstable soils, the mobility of which has a substantial impact on the changes in spatial arrangement of field facilities, not only in the direct process of construction, but also during their scale-up and equipment overhaul. The paper examines implementation of 3D spatial arrangement modelling of industrial facilities into the process of construction and installation works at hydrocarbon fields in the northern construction-climatic zone. The purpose of implementing this method combined with 3D spatial modelling of equipment connections lies in reliability and safety enhancement of the facilities throughout their entire lifespan. Authors analyze statement and solution of the problem associated with alignment and installation of prefabricated equipment and pipelines, taking into account advanced technologies of 3D design and modelling. The study examines a 3D spatial model with the elements of equipment connection geometry; the model is related to existing production facilities at the field. Authors perform an analysis and in mathematical terms formulate the problem of optimal spatial arrangement for such models. The paper focuses on typical deviations, occurring in the installation process of constructions and connection facilities, their spatial arrangement is modelled. Possible solutions are offered, as well as an algorithm of their implementation at an operating field

    Extracting novel facts from tables for Knowledge Graph completion

    Get PDF
    We propose a new end-to-end method for extending a Knowledge Graph (KG) from tables. Existing techniques tend to interpret tables by focusing on information that is already in the KG, and therefore tend to extract many redundant facts. Our method aims to find more novel facts. We introduce a new technique for table interpretation based on a scalable graphical model using entity similarities. Our method further disambiguates cell values using KG embeddings as additional ranking method. Other distinctive features are the lack of assumptions about the underlying KG and the enabling of a fine-grained tuning of the precision/recall trade-off of extracted facts. Our experiments show that our approach has a higher recall during the interpretation process than the state-of-the-art, and is more resistant against the bias observed in extracting mostly redundant facts since it produces more novel extractions

    Substantiation of optical criterions of thermal-oxidative stability of lubricating

    Get PDF
    Research results of criteria of thermal-oxidative stability determined by different combinations of coefficient of absorption of light quantity and optical density with coefficient of evaporation and kinematic viscosity of oxygenated oil are presented. It is shown that the amount of optical density and the coefficient of evaporation divided by coefficient of relative viscosity are the most effective criteria of thermal-oxidative stability of lubricating oils described by second order polynomial with a high correlation coefficient. © Published under licence by IOP Publishing Ltd

    РАСПРЕДЕЛЕНИЕ ЭЯКУЛЯТОВ ПО ЧИСЛУ СПЕРМАТОЗОИДОВ У БЫКОВ ГОЛШТИНСКОЙ ПОРОДЫ В ВОЗРАСТЕ 6–7 ЛЕТ

    Get PDF
    The paper highlights the results of the system analysis carried out in order to explore the primary characteristics of native seed from Holstein servicing bulls of Black-and-White breed aged 6-7 years. The analysis was carried out to rank ejaculates by the total number of spermatozoa. The experiment was carried out at OAO “Moskovskoye on pedigree activities” in 2011 - 2013. The researchers tested 5964 ejaculates, obtained from 17 bulls during three years, in relation to the season. The researchers explored the number of ejaculates (n), defect of the native sperm when taking (%), and the seasonal dynamics of the total number of sperm cells in the ejaculate. The keeping technology, feeding and maintenance, sperm collection, its assessment and cryopreservation was carried out in accordance with the requirements of the “National technology of producing and application of servicing bulls’ sperm “. Ejaculates were distributed into three groups according to the total number of spermatozoa: - up to 3 billion; - 3.1 - 5.0 billion; - 5.1 billion and more. The researchers found out that the number of ejaculates of good quality selected for cryopreservation in the mature reproductive aged bulls (6 - 7 years) is 76% on average with a variability of 73-82%, depending on the season and year of maintenance. The highest number of the ejaculates of good quality was obtained in autumn and it was 82.3%, which is 7-10% higher than that in other seasons (p<0.001). The analysis of ejaculates over a three-year period showed that the first group contained the main mass of ejaculates, where the number of spermatozoa was up to 3 billion. This parameter varied within 40-48% from season to season; the researchers observed the ejaculates that have total number of spermatozoa equal to 3.1 - 5.0 billion were 28.5-33.4%, the ejaculates that have the highest number of spermatozoa (5.1 billion and more) were 23-29%.Был проведен системный анализ массивов первичных характеристик нативного семени, полученного от быков-производителей голштинской породы черно-пестрой масти в зрелом репродуктивном возрасте (6–7 лет) для ранжирования эякулятов по общему числу выделенных сперматозоидов. Опыт проводился на базе ОАО «Московское» по племенной работе в 2011–2013 гг. Всего было проанализировано 5964 эякулята, полученных от 17 быков в течение трех лет в зависимости от сезона года. Изучено число эякулятов, брак нативного семени при взятии, сезонная динамика общего числа сперматозоидов в эякуляте. Взятие семени, его оценка и криоконсервация проводились согласно требованиям Национальной технологии получения и использование спермы племенных быков-производителей. Эякуляты были распределены на три группы по общему числу сперматозоидов: до 3 млрд; 3,1–5,0 млрд; 5,1 млрд и выше. Установлено, что у быковпроизводителей зрелого репродуктивного возраста (6–7 лет) количество качественных эякулятов, отобранных для криоконсервации, составляет в среднем 76 % с вариабельностью 73–82 % в зависимости от сезона и года эксплуатации. Наибольшее количество качественных эякулятов было получено в осенний сезон и составило 82,3 %, что на 7–10 % выше по сравнению с остальными сезонами года (Р<0,001). Анализ эякулятов за трехлетний период показал, что основной массив распределен в первой группе, где число сперматозоидов составило до 3 млрд. Данный показатель в зависимости от сезона года варьировал между 40–48 %. Эякуляты, имеющие общее количество сперматозоидов от 3,1 до 5,0 млрд, составили 28,5–33,4 %, эякуляты с наивысшим числом сперматозоидов (5,1 млрд и выше) – 23–29 %
    corecore